The Annals of Probability 
2009, Vol. 37, No. 6, 2332-2356 
DOI: 10.1214/09-AOP471 

© Institute of Mathematical Statistics, 2009 

CONTACT PROCESSES ON RANDOM GRAPHS WITH POWER 
LAW DEGREE DISTRIBUTIONS HAVE CRITICAL VALUE 

By Shirshendu Chatterjee and Rick Durrett 1 

Cornell University 

If we consider the contact process with infection rate A on a ran- 
dom graph on n vertices with power law degree distributions, mean 
field calculations suggest that the critical value A c of the infection 
rate is positive if the power a > 3. Physicists seem to regard this as 
an established fact, since the result has recently been generalized to 
bipartite graphs by Gomez-Gardenes et al. [Proc. Natl. Acad. Sci. 
USA 105 (2008) 1399-1404]. Here, we show that the critical value A c 
is zero for any value of ct > 3, and the contact process starting from 
all vertices infected, with a probability tending to 1 as n — » oo, main- 
tains a positive density of infected sites for time at least exp(n 1- ' 5 ) 
for any 5 > 0. Using the last result, together with the contact process 
duality, we can establish the existence of a quasi-stationary distribu- 
tion in which a randomly chosen vertex is occupied with probability 
p(A). It is expected that p(A) ~ CX 13 as A — » 0. Here we show that 
a — 1 < f3 <2a — 3, and so (3 > 2 for a > 3. Thus even though the 
graph is locally tree- like, /3 does not take the mean field critical value 
0=1. 

1. Introduction. In this paper we will study the contact process on ran- 
dom graphs with a power-law degree distribution, i.e., for some constant a, 
the degree of a typical vertex is k with probability pk ~ Ck~ a as k — > oo. 
Following Newman, Strogatz and Watts (2001, 2002), we construct the ran- 
dom graph G n on the vertex set {1,2,..., n} having degree distribution p = 
{pk '■ k > 0} as follows. Let d\, . . . ,d n be independent and have the distribu- 
tion P(di = k) =pk- We condition on the event E n = {d\ + • — h d n is even} 
to have a valid degree sequence. As P(E n ) — > 1/2 as n — > oo, the conditioning 
will have a little effect on the distribution of d^s. Having chosen the degree 
sequence (di,c?2, ■ • ■ ,d n ), we allocate di many half-edges to the vertex i, and 
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then pair those half-edges at random. We also condition on the event that 
the graph is simple, i.e., it neither contains any self- loop at some vertex, 
nor contains multiple edges between two vertices. It can be shown [see e.g. 
Theorem 3.1.2 of Durrett (2007)] that if the degree distribution p has finite 
second moment, i.e., if a > 3, the probability of the event that G n is simple 
has a positive limit as n — > oo, and hence the conditioning on this event will 
not have much effect on the distribution of d^s. 

We will be concerned with epidemics that take place on these random 
graphs. First consider the SIR (susceptible- infected-removed) model, in which 
sites begin as susceptible, and after being infected they get removed, i.e., 
become immune to further infection. In the simplest discrete-time formu- 
lation, an infected site x at time n will always be removed at time n + 1 
and for each susceptible neighbor y at any time n, x will cause y to become 
infected at time n + 1 with probability p, with all of the infection events 
being independent. 

In this case the spreading of the epidemic is equivalent to percolation. To 
compute the threshold p c for a large, i.e., 0(n), epidemic to occur with pos- 
itive probability, one notes that for a randomly chosen vertex x, the number 
of vertices at distance m from x, Z m , is approximately a two-phase branch- 
ing process in which the number of first generation children has distribution 
p, but in the second and subsequent generations the offspring distribution 
is the size biased distribution q = {qk : k > 0} satisfying 

(1.1) g fc _i = — , where \i = 

This occurs because vertices with degree k are k times as likely to be chosen 
for connections, and the edge that brings us to the new vertex uses up one 
of its degrees. For more details on this and the facts that we will quote in 
the next paragraph, see Chapter 3 of Durrett (2007). 

With the above observation in hand, it is easy to compute the critical 
threshold for the SIR model. Let v be the mean of the size biased distribu- 
tion, 

(1.2) f = 5>? fc . 

k 

Suppose we start the infection at a randomly chosen vertex x. Now if Y m 
is the number of sites at distance m from x that become infected, then 
EY m = p[i{pv) m ~ l . So the epidemic is supercritical if and only if p > I /v. 
In particular, if pk ~ Ck~ a as k — > oo and a < 3, then v = oo and p c = 0. 
Conversely if a > 3 then v < oo and p c = 1/v > 0. Hence for the SIR epidemic 
model on the random graph G n with power-law degree distribution, there 
is a positive threshold for the infection to survive if and only if the power 
a > 3. 
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We will study the continuous-time SIS (susceptible-infected-susceptible) 
model and show that its behavior differs from that of the SIR model. In 
the SIS model, at any time t each site x is either infected or healthy (but 
susceptible). We often refer to the infected sites as occupied, and the healthy 
sites as vacant. We define the functions {Q - t > 0} on the vertex set so that 
Ct( x ) equals or 1 depending on whether the site x is healthy or infected 
at time t. An infected site becomes healthy at rate 1 independent of other 
sites and is again susceptible to the disease, while a susceptible site becomes 
infected at a rate A times the number of its infected neighbors. Harris (1974) 
introduced this model on the d-dimensional integer lattice and named it the 
contact process. See Liggett (1999) for an account of most of the known 
results. We will make extensive use of the self-duality property property of 
this process. If we let £ 4 = {x:Q(x) = 1} to be the set of infected sites at 
time t, we obtain a set-valued process. If we write £f to denote the process 
with £q = A, then the self-duality property says that 



for any two subsets A and B of vertices. 

Pastor-Satorras and Vespignani (2001a, 2001b, 2002) have made an ex- 
tensive study of this model using mean- field methods. Their nonrigorous 
computations suggest the following conjectures about A c the threshold for 
"prolonged persistence" of the contact process: 

• If a < 3, then A c = 0. 

• If 3 < a < 4, then A c > but the critical exponent (3, which controls the 
rate at which the equilibrium density of infected sites goes to 0, satisfies 



• If a > 4, then A c > and the equilibrium density ~ C(A — A c ) as A J, A c , 
i.e. the critical exponent (3 = 1. 

Notice that the conjectured behavior of A c for the SIS model parallels the 
results for p c in the SIR model quoted above. 

Gomez-Gardehes et al. (2008) have recently extended this calculation to 
the bipartite case, which they think of as a social network of sexual contacts 
between men and women. They define the polynomial decay rates for degrees 
in the two sexes to be jm and jf, and argue that the epidemic is supercritical 
when the transmission rates for the two sexes satisfy 



where the angle brackets indicate expected value and k is shorthand for the 
degree distribution. Here A c is positive when jm,1f > 3. 

Our first goal is to show that A c = for all a > 3. Our proof starts with 
the following observation due to Berger et al. (2005). Here, we follow the 
formulation in Lemma 4.8.2 of Durrett (2007). 



(1.3) 



p>l. 
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Lemma 1.1. Suppose G is a star graph with center and leaves 1,2, ... ,k. 
Let At be the set of vertices infected in the contact process at time t when 
A = {0}. IfkX 2 oo, then P(A cxp{kX 2 /10) ^0)^1. 

Based on results for the contact process on (Zmodn) by Durrett and Liu 
(1988) and Durrett and Schonmann (1988), and on (Zmodn)'' by Mountford 
(1993), it is natural to conjecture that in the contact process on G n , with 
probability tending to 1 as re — > oo, the infection survives for time > exp(cre) 
for some constant c. It certainly cannot last longer, because the total number 
of edges is 0(n), and so even if all sites are occupied at time 0, there is a 
constant c so that with probability > exp(— cn) all sites will be vacant at 
time 1. Our next result falls a little short of that goal. 

Theorem 1. Consider a Newman, Strogatz and Watts random graphs 
G n on the vertex set {1,2, ... ,n}, where the degrees di satisfy P(di = k) ~ 
Ck~ a as k — > oo for some constant C and some a > 3, and P(di < 2) = 0. 
Let {£j : t > 0} denote the contact process on the random graph G n starting 
from all sites occupied, i.e., £q = {1,2, . . . ,n} . Then for any value of the 
infection rate A > 0, there is a positive constant p(\) so that for any 5 > 

inf p(—>p(\))^l asn^oo. 

t<exp(n 1 -' 5 ) \ n J 

One could assume that v > 1 and look at the process on the giant compo- 
nent, but we would rather avoid this complication. The assumption P[di < 
2) = is convenient, because it implies the following. 

Lemma 1.2. Consider a Newman, Strogatz and Watts graphs, G n , on n 
vertices, where the degrees of the vertices, di, satisfy P(di < 2) = 0, and the 
mean of the size biased degree distribution v < oo. Then 

P(G n is connected) — > 1 as oo, 

and if D n is the diameter of G n , 

P(D n > (1 + e) log nj log v) — > for any e > 0. 

The size of the giant component in the graph is given by the nonextinction 
probability of the two-phase branching process, so P{di < 2) = is needed 
to have the size ~ n. Intuitively, Lemma 1.2 is obvious because the worst 
case is the random 3-regular graph, and in this case, the graph is not only 
connected and has diameter ~ (log re)/ (log 2), see Sections 7.6 and 10.3 of 
Bollobas (2001), but the probability of a Hamiltonian cycle tends to 1, see 
Section 9.3 of Janson, Luczak, and Rucihski (2000). We have not been able 
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to find a proof of Lemma 1.2 in the literature, so we give one in Section 5. 
By comparing the growth of the cluster with a branching process it is easy 
to show P(D n < (1 — e) logra/ log v) — > for any e > 0. 

In a sense the main consequence of Theorem 1 is not new. Berger et 
al. (2005), see also (2009), show that A c = for a generalization of the 
Barabasi-Albert model in which each new point has m edges which are 
with probability (3 connected to a vertex chosen uniformly at random and 
with probability 1 — [3 to a vertex chosen with probability proportional to 
its degree. Theorem 2 in Cooper and Frieze (2003) shows that such graphs 
have power law degree distributions with a = l + 2/(l — (3), so these examples 
have a € [3, oo) and A c = 0. 

Having acknowledged the previous work of BBCS, it should be noted that 
(i) our result applies to a large class of power law graphs that have a different 
structure; and (ii) the BBCS proof yields a lower bound on the persistence 
time of exp(cn 1// ^° -1 ^) compared to our exp(n 1-5 ). Our improved bound on 
the survival times relies only on the power law degree distribution and the 
fact that the diameter is bounded by Clogn, so it also applies to graphs 
BBCS consider. 

Theorem 1 shows that the fraction of infected sites in the graph G n is 
bounded away from zero for a time longer than exp(n 1//2 ). So using self- 
duality we can now define a quasi-stationary measure ^ on the subsets 
of {1,2, ... ,n} as follows. For any subset of vertices A, P^lo D A ^ 0) = 
P(C^ p f n i/2\ )- Let X n be uniformly distributed on {1,2, . . . ,n} and let 

PnW = P(X n G do)- Berger et al. (2005) show that for the contact process 
on their preferential attachment graphs, there are positive, finite constants 
so that 

b\ C < p n {\) < B\ c . 

In contrast, we get reasonably good numerical bounds on the critical expo- 
nent. 

Theorem 2. Suppose a > 3. There is a Xq > so that if < A < Ao 
and < 6 < 1, then there exists two constants c(a,5) and C(a,5) so that as 
n — > oo 

p(cA l + (a- 2 )(2+<5) < pn{x) < cx i +(a - m ~8) ) ^ L 

When a is close to 3 and 5 is small, the powers in the lower and upper 
bounds are close to 3 and 2. The ratio of the two powers is < (2 + 5)/(l — 5) ~ 
2 when 5 is small. 

The intuition behind the lower bound is that if the infection starts from 
a vertex of degree d(x) > (10/A) 2+<5 , then it survives for a long time with a 
probability bounded away from 0. The density of such points is C , A^ 2+<5 - )( ' a_1 ^ , 
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but we can improve the bound to the one given by looking at neighbors of 
these vertices, which have density CA^ 2+<5 ^ a_2 ^ and will infect their large 
degree neighbor with probability > cA. 

For the upper bound we show that if m(a, 5) is large enough and the 
infection starts from a vertex x such that there is no vertex of degree > 
within distance m from x, then its survival is very unlikely. To get 
the extra factor of A we note that the first event must be a birth. Based on 
the proof of Lemma 1.1, we expect that survival is unlikely if there is no 
nearby vertex of degree > A~ 2 and hence the lower bound gives the critical 
exponent. 

It is natural to speculate that the density of the quasi-stationary measure 
Pn(A) — ► p(A) as n — > oo. By the heuristics for the computation of A c in the 
SIR model, it is natural to guess that, when a > 2, p(X) is the expected 
probability of weak survival for the contact process on a tree generated by 
the two-phase branching process, starting with the origin occupied. 

Here the phrase 'weak survival' refers to set of infected sites being not 
empty for all times, in contrast to 'strong survival' where the origin is re- 
infected infinitely often. As in the case of the contact process on the Bol- 
lobas-Chung small world studied by Durrett and Jung (2007), it is the weak 
survival critical value that is the threshold for prolonged persistence on the 
finite graph. 

Sketch of the proof of Theorem 1. The remainder of the paper is devoted 
to proofs. Let V£ be the set of vertices in the graph G n with degree at least 
n £ . We call the points in V£ stars. We say that a star of degree k is hot if 
at least Xk/4 of its neighbors are infected and is lit if at least Xk/10 of its 
neighbors are infected. Our first step, taken in Lemma 2.2, is to improve the 
proof of Lemma 1.1 to show that a hot star will remain lit for time exp(cn e ) 
with high probability. 

To keep the system going for a long time, we cannot rely on just one star. 
There are 0(ra 1— e ( a— 1 ') stars in this graph which has diameter O(logn). 
If one star goes out, presence of a lit star can make it hot again within a 
time 2n £//3 with probability at least n~ b . See Lemmas 2.3 and 2.4 for this. 
Lemma 2.6 shows that a lit star gets hot within 2exp(n £//3 ) units of time 
with probability 

> l-5exp(-AV /3 /16), 

and Lemma 2.5 shows that a hot star eventually succeeds to make a nonlit 
star hot within exp(n e / 2 ) units of time with probability 

> i _ 8e - A2n£ / 80 . 

Using these estimates, we can show that the number of lit stars dominates 
a random walk with a strong positive drift, and hence more than 3/4's of 
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the collection will stay lit for a time 0(exp(n 1_Q£ )). See Proposition 1 at 
the end of Section 2 for the argument. 

To get a lower bound on the density of infected sites, first we bound 
the probability of the event that the dual process, starting from a vertex 
of degree (10/A) 2+<5 , reaches more than 3/4's of the stars. We do this in 
two steps. In the first step (see Lemma 3.2) we get a lower bound for the 
probability of the dual process reaching one of the stars. To do this, we 
consider a chain of events in which we reach vertices with degree (10/A) fc+<5 
for k > 2 sequentially. In the second step (see Lemma 3.3) we again use 
a comparison with random walk to show that, with probability tending 
to 1, the dual process, starting from any lit star, will light up more than 
3/4's of the stars. Then we show that the above events are asymptotically 
uncorrelated, and use a second moment argument to complete the proof of 
Theorem 1 and the lower bound for the density in Theorem 2. 



Open Problem. Improve the bounds in Theorem 2 and extend the 
result to a > 1. 



When 2 < a < 3 the size biased distribution has infinite mean. Chung 
and Lu (2002, 2003) obtained bounds on the diameter in this case, and later 
van der Hofstadt, Hooghiemstra and Zamenski (2007) showed if H n is the 
distance between 1 and 2 then 



2 log log n 

-tin n ' 



■log(a-2) 

When 1 < a < 2 the size-biased distribution has infinite mass, van der 
Esker et al. (2005) have shown in this case 

lim P(H n = 2) = lim 1 - P(H n = 3) =p G (0, 1) 

n— >oo n— >oo 

so the graph is very small. 

All of the results about the persistence of infection at stars in Section 2 are 
valid for any a, since they only rely on properties of the contact process on 
a star graph and an upper bound on the diameter. The results in Section 3, 
rely on the existence of the size biased distribution and hence are restricted 
to a > 2. The proof of the lower bound should be extendible to that case, 
but the proof of the upper bound given in Section 4 relies heavily on the 
size-biased distribution having finite mean. When 1 < a < 2, the size-biased 
distribution does not exist and the situation changes drastically. We guess 
that in this case p n (A) = 0(A). 
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2. Persistence of infection at stars. Let e > and let V£ be the set of 

vertices in our graph G n with degree at least n £ . We call these vertices 
stars. We say that a vertex of degree k is hot if it has at least L = Afe/4 
infected neighbors and we call it lit if it has at least 0.4L = Xk/10 infected 
neighbors. We will show that if e is small, then in the contact process starting 
from all vertices occupied, most of the stars in V£ will remain lit for time 
0(exp(n 1 - aE )). 

We begin with a slight improvement of Lemma 1.1 which gives a numer- 
ical estimate of the failure probability, but before that we need two simple 
estimates. 

Lemma 2.1. 7/0<a;<a<l then e x < 1 + (l + a)x and e~ x < 1 - (1 - 
2o/3)x. 

Proof. Using the series expansion for e x 

^--tHiHs) 4 --) 

and summing the geometric series gives the result. □ 

Lemma 2.2. Let G be a star graph with center and leaves 1,2, ...,k. 
Let At be the set of vertices infected in the contact process at time t. Suppose 
A < 1 and X 2 k > 50. Let L = Xk/4 and let T = exp(/cA 2 /80)/4L. Let P L>i 
denote the probability when at time the center is at state i and L leaves 
are infected. Then 

Piifinf \A t \ <0AL) <7e~ x ' 2k/80 fori = 0,1. 

' \t<T ) 

Proof. Write the state of the system as (m,n) where m is the number 
of infected leaves and n = 1 if the center is infected and otherwise. To 
reduce to a one-dimensional chain, we will concentrate on the first coordi- 
nate. When the state is (m,0) with m > 0, the next event will occur after 
exponential time with mean \/{mX + m), and the probability that it will be 
the reinfection of the center is A/(A + 1). So the number of leaf infections N 
that will die while the center is has a shifted geometric distribution with 
success probability A/(A+ 1), i.e., 

P(ff-j) = (xtt) J ' XT! for ^°- 
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Let Nl be the realization of N when the state of the system is (L, 0). Then 
Nl will be more than 0.1L with probability 

(2.1) P L ,o(N L > 0.1L) < (1 + A)~ aiL < e~ AL / 20 = e"* 2 */ 80 . 

Here we use the inequality 1 + A > e A / 2 . If Nl < 0.1L, then there will be at 
least 0.9L infected leaves when the center is infected. 

The next step is to modify the chain so that the infection rate is when 
the number of infected leaves is L = Xk/4 or greater. In this case the number 
of infected leaves > Yt where 

at rate 

Y t - Y t - 1, Afc/4, 

Y t ->Y t + l, 3Xk/4 for Y t < L, 

Y t ->Y t - N, 1. 

To bound the survival time of this chain, we will estimate the probability 
that starting from 0.8L it will return to 0.4L before hitting L. During this 
time Yt is a random walk that jumps at rate Xk + 1. Let X be the change 
in the random walk in one step. Then 

with probability (Xk/4)/(Xk + 1), 
X = { +1, with probability (3Afc/4)/(Afc + 1), 
with probability l/(Xk + 1), 




and so 



tie = e ■ — • — + e 



3 Xk a 1 Xk 



4 Afc + 1 4 Xk + 1 



Xk + l^-t \X + l) A + l 



If e~ e /(A + 1) < 1, the third term on the right is 

A 1 



Xk + l 1 + A - 
If we pick 9 < so that e~ e = 1 + A/2, then 

^ ox Afe / 1 3 . , 1 2 \ 

Since 1/(1 + x) < 1 - x + x 2 for < x < 1, 

1 3 /n Wn , 1 2 1 / A A 2 \3 A 2 
iTA^-I + ^ + ^-I + Afc-^H + Tjl+S + Afc 

3A A ^_ 
<_ 16 + 8 + W 
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where in the last inequality, we have used A < 1. Since we have assumed 
X 2 k > 50, the right-hand side is < 0. 

To estimate the hitting probability we note that if (j)(x) = exp(8x) and 
Yq > 0.6L, then <f>(Yt) is a supermartingale until it hits L. Let q be the 
probability that Yt hits the interval (— oo,0.4L] before returning to L. Since 
9 < 0, we have <j>(x) > </>(0.4L) for x < 0.4L. So using the optional stopping 
theorem we have 

#(0.4L) + (1 - q)(f>(L) < 0(O.8L), 

which implies that 

q < (j){0.8L)/4>(0AL) = exp(0.46>L) < e"^ 40 , 

as e~ 9 = 1 + A/2 > e A / 4 when A/4 < 1/2 (sum the series for e x ). 

At this point we have estimated the probability that the chain started at 
a point > 0.8L will go to L before going below 0.4L. When the chain is at 
L, the time until the next jump is exponential with mean 1/(L + 1) > 1/2L. 
The probability that the jump takes us below 0.8L is (since 1 + A > e A / 2 ) 

< (i + X y°- 2L < e~ XL / 10 = e ~ x2k / i0 . 

Thus the probability that the chain fails to return to L, M = e A2fc / 80 times 
before going below 0.4L is 

< 2e- A2fe / 80 . 

Using Chebyshev's inequality on the sum, Sm of M exponentials with mean 
1 (and hence variance 1), 

P(S M < M/2) < 4/M. 

Multiplying by 1/2L we see that the total time, Tm of the first M excursions 
satisfies 

P(T M < M/4L) < 4e" A2fc/80 . 

Combining this with the previous estimate on the probability of having fewer 
than M returns and the error probability in (2.1) proves the desired result. 
□ 

Thus Lemma 2.2 shows that a hot star will remain lit for a long time 
with probability very close to 1. Our next step is to investigate the process 
of transferring the infection from one star to another. The first step in doing 
that is to estimate what happens when only the center of the star infected. 
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Lemma 2.3. Let G be a star graph with center and leaves 1,2, ...,k. 
Let < A < 1, 5 > and suppose X 2+S k > 10. Again let Pn denote the prob- 
ability when at time the center is in state i and I leaves are infected. Let 
tq be the first time becomes healthy, and let Tj be the first time the number 
of infected leaves equals j. If L = Afe/4, 7 = (5/(4 + 25), and K = Afc 1_7 /4, 
then for k > ko(S) 

Po,i(T K >T )<2/k\ 
P K ,l(T < T L ) < exp(-A 2 A: 1 -Vl6) < l/k\ 
E ,i(T L \T L < 00) < 2. 

Combining the first two inequalities Pq,\(Tl < 00) > 1 — 2/k" 1 , and using 
Markov's inequality, if we can infect a vertex of degree at least k such that 
k > ko(S) and \ 2+s k > 10, then with probability > 1 — 5/A; 7 the vertex gets 
hot within the next /c 7 units of time. 



Proof of Lemma 2.3. Note that to ~ exp(l), and for any t < To, the 
leaves independently becomes healthy at rate 1 and infected at rate A. Let 
Po(t) is the probability that leaf j is infected at time t when the central 
vertex of the star has remained infected for all times s <t. po{0) = and 

d ^ = -p (t) + (1 - P0 (t))X = A - (A + l)po(i). 

So solving gives p (t) = $ Ae~( A+1 )(*~ s ) ds = ^(1 - e" (A+1) *). From this it 
follows that 

(2.2) P ,i{T K < t ) > P(Binomial(A;,po(^" 7 )) > K)P(t > A;" 7 ). 
Now if k 1 > 8/3, (A + l)/c~ 7 < 3/4 and it follows from Lemma 2.1 that 

Po(k^) > Xk^/2. 
Writing p = Po(^~ 7 ) to simplify formulas, if 9 > 

P(Binomial(A;,p) < K) < e eK '(1 - p + pe~ e ) k . 
Since log(l + x) < x the right-hand side is 

<exp(^^^ + (e- e -l)^— 

Taking 6 = 1/2 and using Lemma 2.1 to conclude e _1//2 — 1 < —1/3, the 
above is 

< exp(-AA; 1 - 7 /24) < exp(-A; 1/2 ^ 7 /8), 
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since X 2 k > 9. Using this in (2.2), the right-hand side is 

> (1 - eM-k 1/2 ~ J /8))(l ~ k^) > 1 - 2/k\ 

if k 1 / 2 ^ > 87 log /c. 

Using the supermartingale from the proof of Lemma 2.2, if q = Pk, i(Tq < 
Tl), then we have 

q .l + (l- q )e dL <e dK , 

and so q < e eK < e~ XK ^. In the last step we have used e 9 = 1/(1 + A/2) < 
e _A / 4 , which comes from Lemma 2.1. Filling in the value of K, e~ XK ^ = 
e- A2fc1 " 7 / 16 . Now 

^2jfel-7 = {\ 2+5 kf^ 2+5) fcl-7-2/(2+«) > 10 2/(2+5) /c 5/(4+25) > 

So if fc <5 /(4+2«5) > 16 . io- 2 /( 2 +' 5 ) 7 logA;, then e ~ XK ^ < 1/Jfe?. 

To bound the time we use the lower bound random walk Yt from Lemma 
2.2. EN = 1/X, so 

,'AJfe 1\ fX 2 k-2\ 
EY f = — — — U = I — — — t. 



.2 A / V 2A 

Let be the hitting time of L for the random walk Yt- Using the optional 
stopping theorem for the martingale Y t — (X 2 k — 2)t/2X and the bounded 
stopping time T% At we get 

EY t y m - (^^) E(Tl At) = EY = 0. 
Since EY t y m < L = Xk/4, it follows that 

as by our assumption X 2 k > 4. Letting i — ► oo we have ET£ < 1. Since is 
a lower bound for the number of infected leaves, TlX\t l <oo\ — ^/T- Hence 



£ ,i(7l|T l <oo) 



_ -5'o,i( r Ll[r Z/ <oo]) 

P ,l( T L<°°) 

go.iTj 1 

- P ,i(^ < t )Pk,i(T l <T )-T/2 



for large fc. □ 



To transfer infection from one vertex to another we use the following 
lemma. 
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Lemma 2.4. Let vo,V\, . . . ,v m be a path in the graph and suppose that 
vq is infected at time 0. Then the probability that v m will become infected by 
time m is > (e _1 (l — e _A )e _1 ) m . 

Proof. The first factor is the probability that the infection at vq lasts 
for time 1, the second the probability that vq infects v\ by time 1, and the 
third the probability that the infection at v\ remains until time 1. Iterating 
this m times brings the infection from to m. □ 

When the diameter of the graph is < 2 log n, the probability in Lemma 2.4 
is > n~ b for some b E (1/2, oo), and the time required is < 21ogn. Combining 
this with Lemma 2.3 (with k = n £ and 7 = 1/3) shows that if n is large, 
then with probability > Cn~ b we can use one hot star to make another star 
hot within time 2n £ / 3 . Using Lemma 2.2 and trying repeatedly gives the 
following lemma. 

Lemma 2.5. Let s\ and S2 be two stars in V£ and suppose that s\ is 
hot at time 0. Then, for large n, S2 will be hot by time T = exp(n £ / 2 ) with 
probability 

> 1 _ 8e - A2 ™ £ /80_ 

Proof. If n is large, Lemma 2.2 shows that s\ remains lit for T units 
of time with probability > 1 — 7e~ x n£ / 80 . Let t n = 2n £ / 3 and consider the 

discrete time points t n , 2t n , At all of these time points we can think of a 

path starting from an infected neighbor of s± up to S2- Using one such path 
the infection gets transmitted to S2 and it gets hot in 2n £//3 units of time 
with probability > Cn~ b for some constant C. So s\ fails to make S2 hot by 
time T with probability 

< (1 - Cn- b ) T/tn < exp(-Cn- b T/t n ) < exp(-A 2 n £ /80) 

for large n. For the first inequality we use 1 — x < e~ x . Combining with the 
first error probability in this proof, we get the result. □ 

Next we show that a lit star becomes hot with a high probability, and 
then helps to make other nonlit stars lit. 

Lemma 2.6. Let s be a star of V£ and suppose that s is lit at time 0. 
Then s will be hot by time 2exp(n £//3 ) with probability 

> 1 — 5exp(— A 2 n £ / 3 /16) if n is large. 
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Proof. Since s is lit, it has at least An e /10 infected neighbors at time 0. 
If s itself is not infected at time 0, let N be the number of leaf infections that 
die out before s gets infected. Using similar argument as in the beginning 
of the proof of Lemma 2.2, 

/ 1 V A 
P(N = j)= t— r -TTT f°rj>0, 



A + iy A + l 
which implies 

P(N > Xn £ /20) < (1 + A)- AnE / 20 < e -* a »7« 

as 1 + A > e A / 2 by Lemma 2.1. Also the time Tm taken for M = Xn e /20 
leaf infections to die out is a sum of M exponentials with mean at most 
1/(A + 1)M < 1/M. Now if n 2 7 3 > 40/16, the above error probability is 

< e -A 2 n £ / 3 /16 

Using Chebyshev's inequality on the sum, Sm of M exponentials with 
mean 1 (and hence variance 1), we see that if exp(ra £ / 3 ) > 2, i.e., n £//3 > log 2 

1 



P{S M >Mexp(n e/3 )) < 



M(exp(n £ / 3 ) - l) 2 



^ Tf 7o~Tj3\ ^ exp(-A 2 n £ / 3 /16), 

M exp(2n £ / d j 

where in the final inequality we have used M > 4, i.e., n £ > 80/A, and 
A 2 /16<2. 

Multiplying by 1/M we see that the total time, Tm, satisfies 
P(Tm > exp(n £ / 3 ))<exp(-AV/ 3 /16). 

Combining these two error probabilities gives that s will be infected along 
with at least An £ /20 infected neighbors within exp(n £//3 ) units of time with 
error probability 

(2.3) <2exp(-AV /3 /16). 

Now An e /20 > A?i £//3 /4, when n 2e / 3 > 5. So if s is infected and has at least 
An £ /20 infected neighbors, then using the second inequality of Lemma 2.3 
(with 7 = 2/3 and k = n £ ), s becomes hot with error probability 

<exp(-A 2 n e/3 /16). 

Finally using Markov's inequality and the third inequality of Lemma 2.3, 
the time T s taken by s to get hot, after it became infected, is more than 
T = exp(n £ / 3 ) with probability 

< 2exp(-n e/3 ) < 2exp(-AV /3 /16), 
as A < 1. Combining all these error probabilities proves the lemma. □ 
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We now use Lemmas 2.5, 2.6 and 2.2 to prove that if the contact process 
starts from all sites infected, then for a long time at least 3/4's of the stars 
will be lit. 

Proposition 1. Let be the set of stars in V£ which are lit at time 
t in the contact process :t > 0} on G n . Let t n = 2exp(n £//2 ) and M n = 
exp(n 1_Q£ ). Then there is a stopping time T n such that T n > M n ■ t n and 

m^,Tj<(3/4)|y n £ |)<exp(-Cn £ ). 

PROOF. Let a n = \V£\- Clearly \I^ | = a n . We will estimate the proba- 
bility that starting from (7/8)a n lit stars, the number goes below (3/4)a n 
before reaching a n . Define the stopping times t^s' and <7js' as follows. Let 
r = °"0 = and for z > let 

T i+ i = inf{i >n + Oit n : \I* >t \ = (7/8)a n }, 

(Ji+i = min{s eN:|4 jT . +1+s . fn | £ ((3/4)a„, a n )}. 

We need to look at time lags that are multiples of t n in the definition of o~i 
because in our worst nightmare (which is undoubtedly a paranoid delusion) 
all the lit stars of degree k > n e at time Tj+i have exactly 0.1 A: infected 
neighbors. 

Lemma 2.6 implies that a lit star of V£ gets hot within time 2exp(n £ / 3 ) < 
exp(n e / 2 ) (for large n) with probability > 1 — 5exp(— A 2 n e / 3 /16). Combining 
with Lemma 2.2 gives that a lit star at time gets hot by time t n /2 and 
remains lit at time t n with probability > 1 — 6exp(— A 2 n e / 3 /16) for large n. 
Now if \Int\ < a n f° r some t, then the number of lit stars will increase at 
time t + t n with probability > P(A n B), where 

• A: All the lit stars will get hot by t n /2 units of time, and be lit for time 

tn ■ 

• B: A nonlit star will become hot by time t n /2 in presence of another hot 
star, and remain lit for another t n /2 units of time. 

Now using the above argument P(A) > 1 — 6nexp(— A 2 ?i £ / 3 /16), as there 
are at most n stars. Combining Lemmas 2.5 and 2.2 gives P(B) > 1 — 
9exp(— A 2 n £ /80). So P(Af)B) > 1 — exp(— n £ / 4 ) for large n. Using the stop- 
ping times \L n n+r .t n \ > W r for r < Ui, where {W r : r > 0} is a discrete time 
random walk satisfying 

W r — > W r — 1 with probability exp(— n £//4 ), 

(2-4) 

W r — > W r + 1 with probability 1 — exp(— n £/ ), 
and Wq = (7/8)a n . Now 9 Wr is a martingale where 
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If q is the probability that W r goes below (3/4)a n before hitting a n , then 
applying the optional stopping theorem 

q . 0(3/4)c*h + (1 - q) . e a ™ < 0( 7 / 8 ) a ™ , 

which implies 

g < (*n/8) < eX p(-Cn 1 -( Q! - 1 >), 

as a n ~ Cn 1 "^- 1 ) 6 for some constant C . So the probability that the random 
walk fails to return to a n at least M n = exp(n 1_Q£ ) times before going below 
(3/4)a„ is < exp(-Cn £ ). Now if 

K = min{i > 1 : \I* >n+(Ti . t J < (3/4)a n }, 

the coupling with the random walk will imply P(.K" < M n ) < exp(— Cn £ ), 
and hence for T n = T Mn + cr Mn • in 

P(|/^ T J<(3/4)|Kf|)<exp(-Cn £ ). 

As <Tj > 1 for all i, by our construction T n > M n • t n , and we get the result. 
□ 

So the infection persists for time longer than exp(n 1_Q?£ ) in the stars of 

v n • 

3. Density of infected stars. Proposition 1 implies that if the contact 
process starts with all vertices infected, most of the stars remain lit even 
after exp(n 1 ~ Q£ ) units of time. In this section we will show that the density 
of infected stars is bounded away from and we will find a lower bound for 
the density. We start with the following lemma about the growth of clusters 
in the random graph G n , when we expose the neighbors of a vertex one at a 
time. For more details on this procedure see Section 3.2 of Durrett (2007). 

Lemma 3.1. Suppose < 5 < 1/8. Let A be the event that the two clus- 
ters, starting from 1 and 2 respectively, intersect before their sizes grow to 
n s . Then 

P{A)<Cn~^ 4 - s \ 
Proof. If di, . . . ,d n are the degrees of the vertices, then 

for some constant c. Suppose all the degrees are at most n 3 ^ 2a ~ 2 \ Suppose 
R\ and R2 are the clusters starting from 1 and 2 up to size n 5 . Let B be the 
event that R± contains a vertex of degree > n l ^ 2a ~ 2 \ Let e n be the sum of 
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degrees of all those vertices with dee ree >n 1 /( 2 "" 2 ). While growing R± the 
probability that a vertex of degree > n 1 ^ 2 "" 2 ) will be included on any step 
is 

~ £?=i^-n 5 + 3 /( 2 "- 2 ) ~P»- 
Since the size biased distribution is g& ~ Ck~( a ~ 1 ^ as k — > oo, J2s>kQs ~ 

(jfe-(a-2) ag ^ an( j we jj ave g n ^ (j n l-(a-2)/(2a-2) an( j ^ ^ ^ 

£< n -(a-2)/(2a-2) ag n _^ g Q £ Qr i ar g e n f3 n < c\n~ 1 ^ for some constant 
ci, when a > 3. Thus 

P(£ c ) > 1 - ci/n 1 / 4 " 5 . 

If B c occurs, all the degrees of the vertices of R± are at most n l ^ 2a ~ 2 \ In 
that case, while growing R2 the probability of choosing one vertex from R\ 
is 

„<5+l/(2a-2) 
< U I < C9 /n 1 - 5 - 1 /(2a-2) 

Ya=i di - n 5+3 /( 2a -y 1 
So the conditional probability 

P{A C \B C ) > (1 - C2rl -(l-«5-l/(2«-2)))^ > j _ C2/ri l-25-l/( 2a -2)_ 

Hence combining these two 

P(A C ) > (1 - d/nV^Xi - cs/n 1 " 25 - 1 /^- 2 )) > 1 - Ci/n 1 / 4 "*, 
and that completes the proof. □ 

Lemma 3.1 will help us to show that in the dual contact process, staring 
from any vertex of degree > (10/A) 2+<5 for some 5 > 0, the infection reaches 
a star of V£, with probability bounded away from 0. 

Lemma 3.2. Let £^ fee i/ie contact process on G n starting from £q = A. 
Suppose < e < l/20(a — 1). T/ien i/iere are constants Xq > 0, no < oo, 
c o = c o(-^] £ ) an d Pi > independent of A < Ao, n>nQ and e smc/i i/iat i/ 
T = n c °, ^2 is o, vertex with degree d(v2) > (10/A) 2+<5 /or some < 5 < 1 and 
v\ is a neighbor of V2, 

P(# 2} n v*) > P2 , P(4 u l\ n v-) > piX. 

Proof. The second conclusion follows immediately from the first, since 
the probability that v\ will infect v% before time 1, and that V2 will stay 
infected until time 1 is 

> T ^(l-e-( A+1 V 1 >cA. 
~ X + V ' 
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Let A m be the set of vertices in G n of degree > (10/X) m+S for m>2. Define 
7 = 2(TfS) and 

B = 2(a — 1) log(10/A), u = (e^l - e^e -1 ) - ^"^, 
w n = log(n £ )/ log(10/A) -5, T m = + T 2 , 
where = {W/\)( m+6 ^, = u m , and we let n c ° = £™'l 2 T m . 

Define E^ = {£,t 2 2 ^ ^^3 7^ ^} an d for m > 3, having defined E2, ■ ■ ■ , E m —i, 
we let 

^™ = U£ m} nA m+1 /0} and Vm GCfc l} nA m . 

Let j4 m be the event that the clusters of size (10/A)( m+<5+1 )( Q ~ 2 ) starting 
from two neighbors of v m do not intersect and 

Wn 

m=2 

Since e < l/20(a - 1), the cluster size (io/A)( m+5+1 )( a - 2 ) is at most n 1 / 10 
for m < w n . So using Lemma 3.1 and the fact (fj) < k 2 , 



p{f c ) < [ ( 10 A) 2 — j cn 



^2m+25 \ rn -(1/4-1/10) 

\m=2 

< n^cn-^/ 4 " 1 / 10 ) < cn-( 1 /4-3/20) < 1/6 

for large n. 

Since each vertex has degree at least 3, if F occurs then by the choice 
of B the neighborhood of radius Bm around v m will contain more than 
^io^(m+5+i)(a-2)+m vertices. Let G m be the event that the neighborhood of 
radius Bm around v m intersects A m+ i. In the neighborhood of v m proba- 
bility of having a vertex of A m+ i is at least c(A/10)( m+l5+1 )( Q! ~ 2 ) . Hence 

P{G c m n F) < (1 - c(A /io)( m+5+1 )( Q " 2 ) )( 10 / A)m+(m+5+1)(a " 2) 

<exp(-(10/AD. 

If A is small, Em= 2 exp(-(10/A) m ) < 1/6. 

On the intersection of F and G m we have a vertex of A m+ i within radius 

Bm of v m . Using Lemmas 2.2 and 2.3, in the contact process {^ Vm ^ : t > 0}, 
v m gets hot at time and remains lit till time T m with error probability 
< cA( m+5 ^ 7 for small A. If v m is lit, then Lemma 2.4 shows that v m fails to 
transfer the infection to some vertex in A m +i within time with proba- 
bility 

< [1 _ ( e -i(l _ e -A )e -y m] T£/(B™) 

< expKe-^l - e- x )e- l )~ m /(Bm)] = Vm , 
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where = indicates we are making a definition, and hence P{E^ n G m F) < 
cA (m+5) 7 if \ i s sma ll ^»™ 2 [cA(™ +i ^ + j) ra ] < 1/6, we can take p 2 = 1/2 
and the proof is complete. □ 

Lemma 3.2 gives a lower bound on the probability that an infection start- 
ing from a neighbor of a vertex of degree > (10/A) 2+<5 reaches a star. Lemma 
2.3 shows that if the infection reaches a star, then with probability tending 
to 1 the star gets hot within re £ / 3 units of time. Combining these two we get 
the following. 

Proposition 2. Suppose < e < l/20(a — 1). There are constants X > 
0, no < oo ci = ci(A,e) and pi > 0, which does not depend on A < Ao, n>nQ 
and e, such that for any vertex V\ with a neighbor u 2 of degree d(v 2 ) > 

(10/A) 2+<5 for some 5 € (0, 1), and T = n Cl the probability that contains 
a hot star is bounded below by p\\. 

Next we will show that if we start with one lit star, then after time 
exp(n £//2 ) at least 3/4's of the stars will be lit. 

Lemma 3.3. Let t be the set of stars which are lit at time t in the 
contact process on G n such that |I £ | = 1. Then for T' = exp(n £//2 ) 

P{\F n>T ,\ < (3/4)|^|) < 7exp(-AV/ 3 /16). 

Proof. Let s\ be the lit star at time 0. As seen in Proposition 1, si 
remains lit at time T" = exp(n £//2 ) with probability > 1 — 6exp(— A 2 n £//3 /16) 
for large n. With probability > Cn~ h another star gets hot within time 
t n = 2n £ / 3 and remains lit at time T 1 . Using similar argument as in Lemma 
2.5, the process fails to make (3/4)|V^| many stars lit by time T' with 
probability 

<(3/4)\V*\(l-Cn- b ) T ' /tn 

< (3/4)|T/ e |exp(-C7n- 6 T7t n ) < exp(-AV /3 /16), 

as \V£\ = Cre 1- ( a-1 ) £ and 1 — x < e~ x . So combining with the earlier error 
probability we get the result. □ 

Now we are almost ready to prove our main result. However, we need one 
more lemma that we will use in the proof of the theorem. 

Lemma 3.4. Let F and G be two events which involve exposing n s many 
vertices starting at 1 and 2 respectively for some < 5 < 1/8. Then 

\P{F n G) - P{F)P{G)\ < Cn~^ 4 - 5 l 
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Proof. Let R\ and R2 be the clusters for exposing n many vertices 
starting from 1 and 2 respectively, and let A be the event that they intersect. 
Clearly 



We estimate P(A) using Lemma 3.1. □ 

Lemma 3.4 shows that two events which involve exposing at most n 1 / 8 
vertices starting from two different vertices are asymptotically uncorrelated. 
Now we give the proof of the main theorem. 

Proof of Theorem 1. Given 5 > 0, choose e = min{<5/a, l/20(a — 1)}. Let 
A n be the set of vertices in G n with a neighbor of degree at least (10/A) 2+5 . 
Clearly |^4 n |/n — > co(A/10)( 2+5 ^ a-2 ) as n — > 00 for some constant cq. Define 
the random variables Y x , x G A n as Y x = 1 if the dual contact process starting 
from x can light up a star of V £ and otherwise. By Proposition 2, EY X > 
p±X for some constant pi> and for any x G A n . 

If we grow the cluster starting from x £ A n and exposing one vertex at a 
time, we can find a star on any step with probability at least cn-( Q - 2 ) £ . So 
with probability 1 — exp(— cn e ), we can find a star of within the exposure 
of at most n a£ vertices. So, with high probability, lighting a star up is an 
event involving at most n^ a+1 ^ £ many vertices. As (a + l)e < 1/8, using 
Lemma 3.4, we can say 



P(F n G) < P(A) + P(F nGn A c ) 

= P(A) + P(F n A C )P(G n A c ) 
<P(A) + P(F)P(G). 



Using similar argument for F c and G we get 



\P(F n G) - P(F)P(G)\ < P(A). 



P(Y X = 1, Y z = 1) - P(Y X = l)P(Y z = 1) 

< (1 - exp(-cn £ ))Cn-^ 4 - (a+1 ^ + exp(-cn e ) = 9, 



Using our bound on the covariances 



var 




and Chebyshev's inequality gives 



P 








as n 
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for any 7 > 0, since 6 n — ► as n — > 00. Since EY X > p±X and |^4 n |/n — > 
c (A/10)( 2+5 )( Q - 2 ), if we take p z =piA • c (A/10)( 2+5 )( Q - 2 )/2 then 

(3.2) lim p( V Y^np,) =1. 

Now if Y x = 1, Proposition 2 says that the dual process starting from x 
makes a star hot after T\ = n Cl units of time. Then by Lemma 3.3 within 
next T2 = exp(n e / 2 ) units of time the dual process lights up 75% of all the 
stars with probability 1 — 7exp(— A 2 n £//3 /16). 

Let 1!^ t be the set of stars which are lit at time t in the contact process 
U< : t > 0} and 

T 3 = inf{t > exp(n 1 — ) : \I^ t \ > (3/4)|V^|}. 

By Proposition 1, P(T^ < 00) > 1 — exp(— cn £ ). Let 

5 = {Sc{l,2,... l n}:^ = S=> |J* t | > (3/4)|Kf |}. 

Using the Markov property and self-duality of the contact process we get the 
following inequality. For any subset B of the vertex set, and for the event 
F n = [T3 < 00] we have 

= E P ^t 1+ t 2 3 B)P(& 3 = S\F n )P(F n ) 

= E n^0Vxe P)P(4 = 5|P re )P(P n ) 

> E p (\&t 2 n ^,t 3 I > (3/4)|x^| Vx e P)P(4 = 5|F n )P(P„) 

> P(Y X . = 1 Vx G P)(l - 7|P| exp(-AV /3 /16))P(F n ) 

> P(Y X = 1 Vx e P)(l - 2exp(-cn £/4 )), 

as |JB| < n and P{F n ) > 1 — exp(— cn e ). Hence for T = Ti + T2 + T3, combin- 
ing with (3.2) and using the attractiveness property of the contact process 
we conclude that as n — ► 00 

t<T V n J \ n 

(3.3) 

>p(&^{x:Y x = 1}, ]T Y x >n Pl 

which completes the proof of Theorem 1, and proves the lower bound in 
Theorem 2. 
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4. Upper bound in Theorem 2. For the upper bound, we will show that 
if the infection starts from a vertex x with no vertex of degree > 1/A 1_<5 
nearby, it has a very small chance to survive. To get the 1 in upper bound 
we need to use the fact that first event in the contact process starting at x 
has to be a birth so we begin with that calculation. 

Let As be the set of vertices of degree > A 5-1 . Define Z x , x G {1, 2, . . . , n} 
as Z x = 1 if the dual contact process {^j x ^ : t > 0} starting from x sur- 
vives for T' = 1/A a_1 units of time, and otherwise. We will show EZ X < 
(j ^i+(a-2)(i-<$) £ Qr some constant C. If T\ is the time for the first event 
in the dual process, then ET\ < 1 and using Markov's inequality P(T\ > 
l/A"" 1 ) < A"" 1 . So if Ti < l/A"" 1 , the first event must be a birth for Z x to 
be 1. So for x € As, 

P(Z x = l)<P(T 1 >l/X a - 1 )+ £ Pi ^- 
<X a ~ l + CX r{a ~ 1] 

i>\ s ~ l 

< A a-l + CA . A (a-2)(l-5) 

For x G A§, let io(A) < CA( a-2 )( 1_ *) be the size-biased probability of having a 
vertex of A<5 in its neighborhood. If d(x) = i, the expected number of vertices 
in a radius m around x is at most % ■ EZ m , where Z m is the total progeny 
up to mth generation of the branching process with offspring distribution 
Qk = (k + l)pk+i/^ ~ ck a ~ l . So the expected number of vertices, which are 
within a distance m = \{a — 1)/S], the smallest integer larger than (a—l)/6, 
from x and belong to As, is 

(l/A) 1 -* 

< J2 Pi-i-EZ m -CX^- 2 ^ KCX^M. 

i=2 

Using Markov's inequality the probability of having at least one vertex of 
A5 within a distance m from x has the same upper bound as above. 
Until we reach A,5, |^| x ^| < Yt where 

Y t ->Yi-l at rate Y t , 

Y t ^Yt + l at rate Y t X- {l/X) 1 ' 5 = Y t X S . 

So Yt jumps at rate Y t (l + X s ) and it jumps to Y t + 1 with probability 
A 5 /(l + X s ) < X s . If Ti < I/A"" 1 , the first event in the dual process ^} x} 
must be a birth for Z x to be 1. Let T2 m is the time of the 2mth event after 
the first event. Then ET2 m <■ 2m/ (1 + X s ) and using Markov's inequality 

P(T 2m > l/A"" 1 ) < CX a ~\ 
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Now if < l/A"" 1 and there is no vertex of Ag within a distance m of x, 
the infection starting at x survives for time T' only if Yt has at least m up 
jumps before hitting 0. If there are < m — 1 up jumps in the first 2m then 
Yt will hit by T2 m , as Yq = 2. The probability of this event is 

< P(P > m), where B ~ Binomial(2m, A* 5 ), 

22m^m<5 <- 22m^ct— 1 



Combining all three error probabilities, for any x £ A^, 

P(Z X = 1) < P(Tj > l/A^ 1 ) + P(T 2m > 1/A^ 1 ) 

Using an argument similar to one at the end of the proof of Theorem 1 



P 



> Try — > as n — > oo 



for any 7 > 0. Since EZ X < CA 1 +( a " 2 )( 1 - 5 ) for all x £ {1,2, . . . , n}, if we take 
p u = 3CA 1+ ( Q - 2 )( 1 - 5 ), then 

P ^ x — n Pv^J ^ ^ as n — > 00. 

So by making C larger in the definition of p u and using the attractiveness 
of the contact process 

mfP(\£\<p u n)^l 

as n — > 00. 

5. Proof of connectivity and diameter. We conclude the paper with the 
proof of Lemma 1.2. We begin with a large deviations result. The fact is 
well-known, but the proof is short so we give it for completeness. 

Lemma 5.1. Let Xi,X%, . . . be i.i.d., nonnegative with mean fj,. If p < n, 
then there is a constant 7 > so that 



P(X 1 + --- + X k <pk)<e-^ k . 

Proof. Let <p{9) = Ee~ 9X . If 9 > then 

e- epk P(X 1 + ... + X k <pk)< ( ( } (e) k . 
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So we have 

P(X 1 + --- + X k <pk)< exp(k{9p + log (f>(9)}). 
log (0(0)) = and as 9 -> 

So log 4>(9) ~ —p9 as 9 — > 0, and the result follows by taking small. □ 

Proof of Lemma 1.2. We will prove the result in the following steps: 

Step 1. Let fc„ = (logn) 2 . The size of the cluster C x , starting from x £ 
{1, 2, . . . , n}, reaches size fc n with probability 1 — o(n ). 

Step 2. There is a B < oo so that if the size of C x reaches size 5 logn, it 
will reach n 2//3 with probability 1 — 0(n~ 2 ). 

Step 3. Let £ > 0. Two clusters C x and Cy, starting from x and y respec- 
tively, of size n^ 1 / 2 )"^ will intersect with probability 1 — o(n~ 2 ). 

Steps 2 and 3 follow from the proof of Theorem 3.2.2 of Durrett (2007), 
so it is enough to do Step 1. Before doing this, note that if d\, . . . , d n are the 
degrees of the vertices, and rj > then as n — > oo, 

P( max di > n^+^/^-i)) < n .p(d 1 > „(i+»7)/(«-i)) ^ Cn"' 7 . 

Given a > 3, we choose r/ > small enough so that (1 + r))/(a — 1) < 1/2. 

To prove Step 1, we will expose one vertex at a time. Following the no- 
tation of Durrett (2007), suppose A t ,Ut and Rt are the sets of active, un- 
explored and removed sites respectively at time t in the process of grow- 
ing the cluster starting from 1, with Rq = {1}, Aq = {z: 1 ~ z} and Uq = 
{1, 2, . . . , rt} — Aq U Rq. At time r = inf{i : At = 0} the process stops. If 
At 7^ 0, pick zt from ylj in some way measurable with respect to the process 
up to that time and let 

R t +i = R t U {i t }, 

A t+1 = A t U {z £ U t : k ~ z} ~ {**}, 
Ut+i = C/t-{«GE/i:tt~«}. 

Here = t + 1 for i < r and so Ci = r + 1. If there were no collisions, then 
\At+\ | = \ At\ — 1 + Z where Z has the size biased degree distribution q. Let q v 
be the distribution of [Z \ Z < ji^+wA™ - i)). Then on the event {maxjdj < 
n( 1+J? '/( a-1 '}, | At | is dominated by a random walk S"t = So + Z\ + • • • + Zt, 
where So = Aq and Zi ~ q v . Since g^-i = kp^/p, we have go = Qi = and 
hence q^ = qj = 0. Then St increases monotonically. 
If we let T = inf{?n : S m > A; n } then 

(5.1) P(\d\ < k n ) < P(S t - \A t \ > 4 for some t < T). 
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As observed above, if n is large, all of the vertices have degree < nP where 
(3 = (1 + rj)/(a — 1) < 1/2. As long as St < 2k n , each time we add a new 
vertex and the probability that it is in the active set is at most 

for large n. Thus the probability of two or more collisions while St < 2k n is 
<(2fc n ) 2 7 2 = (O. 

If St — St-i < k n , then the previous argument suffices, but St — St-i 
might be as large as n@ . Letting m > 1/(1 — 2/?), we see that the probability 
of m or more collisions is at most 

(n f3 ) m {Cn /3 ~ 1 ) m = o(n~ 1 ). 

To grow the cluster we will use a breadth first search: we will expose all 
the vertices at distance 1 from the starting point, then those at distance 2, 
etc. When a collision occurs, we do not add a vertex, and we delete the one 
with which a collision has occurred, so two are lost. There is at most one 
collision while St < 2k n . Since 5o > 3, it is easy to see that the worst thing 
that can happen in terms of the growth of the cluster is for the collision to 
occur on the first step, reducing So to 1. After this the number of vertices 
doubles at each step so size k n is reached before we have gone a distance 
log 2 k n from the starting point. 

In the final step we might have a jump S T — SV- 1 > k n and m collisions, 
but as long as k n = (logn) 2 > 2m we do not lose any ground. In the growth 
before time T, each vertex, except for possibly one collision, has added two 
new vertices to the active set. From this it is easy to see that the number of 
vertices in the active set is at least k n /2 — 2m. 

To grow the graph now, we will expose all of the vertices in the current 
active set, then expose all of the neighbors of these vertices, etc. Let e > 0. 
The proof of Theorem 3.2.2 in Durrett (2007) shows (see page 78) that 
if 5 is small then until n8 vertices have been exposed, the cluster growth 
dominates a random walk with mean v — e. Let J%, J 2 , ... be the successive 
sizes of the active set when these phases are complete. The large deviations 
result, Lemma 5.1, implies that there is a 7 > so that 

P(Ji+i <{v- 2s) Ji I Ji =ji) < exp(-jji). 

Since J± > (logn) 2 /2 — 8, it follows from this result that with probability 
> 1 — o(n~ 1 ), in at most 

1 \ log n 



,2 Vlog(^-e) 

steps, the active set will grow to size n^ 1 / 2 ^ +< '. Using the result from Step 
3 and noting that the initial phase of the growth has diameter < log 2 k n = 
O (log logn) the desired result follows. □ 
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